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REGULATORY SEQUENCES AND EXPRESSION CASSETTES FOR YEASTS 



5 The invention concerns a DNA sequence which is active as a 

promoter in yeast cells, an expression and optionally secretion system 
containing the DNA sequence, plasmids containing that system, host cells 
that have been transformed with the DNA, and methods of producing 
proteins and polypeptides. 

10 The production of peptides and proteins with gene-technology 

methods has become the usual practice in the meantime and many 
different systems are available for that purpose. Frequently, use is made of 
bacteria systems which however suffer from disadvantages, in particular in 
regard to producing medicaments or vaccines, for example that they 

15 produce pyrogenic substances which have to be removed prior to use, or 
they cannot glycosylate the polypeptides. Therefore, a series of other 
systems for the expression of polypeptides in eukaryotic cells have also 
been developed. An organism which is wide-spread for that purpose is the 
yeast Saccharomyces cerevisiae, the genome of which is known in the 

20 meantime and can be obtained for the vectors and expression systems. 
However the use of that micro-organism also involves disadvantages. Thus 
Saccharomyces is for example temperature-sensitive, which requires 
expensive pieces of equipment for temperature control in culture. 

A genus which is considered for bio-technological methods by virtue 

25 of advantageous properties is the yeast Kluyveromyces. The species K. 
lactis and K. marxianus are classified as GRAS (Generally Recognised As 
Save) and can therefore be used with the same security as Saccharomyces. 
In addition K. marxianus can use a large number of carbon sources and 
energy sources for growth and is not very temperature-sensitive. 

30 Kluyveromyces marxianus can grow at temperatures of up to 45°C and can 
therefore be more easily cultured, in contrast to the temperature-sensitive 
Saccharomyces strains. The cells of rapidly growing K. marxianus strains 
can divide under optimum conditions every 35 minutes. However it was 



hitherto not possible to make optimum use of those good properties as 
reliable, strong-expression variable promoters are lacking for that type of 
yeast and there are scarcely any expression systems available, with which 
proteins can be produced with a high level of effectiveness. 
5 Now, the object of the present invention was to provide a promoter 

which is suitable for expression in yeasts, in particular yeast cells of the 
genus Kluyveromyces, as well as expression systems which can be used 
variably for expression. 

That object is attained in that in accordance with the invention a DNA 
10 sequence which includes the nucleotide sequence in accordance with SEQ 
ID No 1 

GAGGCCTGTC CGATTATTAA ACTTGCGGCA CCCGAGTTTG TGACCTTCGA 
CGACATGTTT TATTTCCACA CCGTAGCTAC ACTTTCTATG TAGTAAGTAG 
GTAGTATGGA TGGTAGCTAG TAGAAACTAA ACGAAACGAA ATAAATGTGA 

15 AATGTTAGAC GTAAAGGGGA GGGGAAGGGA AGGGGGCGGC GGAGAGACAT 
GCCAGCCAT GCCATTTCAT GGCATGGCAT GTCAAGGGAT ACTGCATGCA 
TGCATGCATA CTTTACCAAT AGCAAAGTAA ATTG CTTTCT TCCCCCATTT 
GAAACTATTC CACCTCAATC CATCTTTTCT ATAATGG GTA TCACCGATCT 
CATGTGTTCT AATAATGCTG CAGGCAACAA CAAATCTTAA AGGCAACTTG 

20 GAATGTAATT TGGTTAATGA TAGATATCAA ACAGCAATGG TGGGCTCCAA 
CCGCATGGAT ATGCTCACCT TATTATCCGG AATTGTTGTT CCGCAGGAAA 
AAAAAAAAAC CTCGAACCAG ATATTAATTA TCCTATCATT ACTGCGTACA 
AAACCCGGGA ACGGTTAACC TGCAGCAGCC GTTTTGCTTA CAGTTCTCAT 
GCACAATCAG CCAGATTTTG CAATAGTATT AACTTAGAAT TAAGGCAACA 

25 TCTTTGGATA TGCATGTAGA GTAAGTCGTT CGAAACCATT ATTATTATTA 
TT ATT ATT AT TATTATTATT ATTATTATTA TTATTAGTAT TATTGAAATT 
GTTATTGTTC TTAGTTTCAC TACTATTATT ATTCATATTC ATGTTATTGA 
CATCGCCGAA CGACCAGCCT CCATACCGAT TAGACAGGAT CTCAAACGTG 
GGCTCCAGAG CTCACACATT ATGCTAAATA ACTATCTACT GTAACAGCTA 

30 C AG A A A A AAA ACTATAAAAG AG CG AG GAT AAACCACTCT CTTGTGAATC 
AGGATCAGTA GGTAACTCAT AAACCTTCTT CTTTTCTCTC AAAATATCAA 
ATAACAGTAG TATCAACAAC GATATCGAAT AATACTAACT ACTACAACAG 
TAGGAACAGT AACGACAACG ACAACGATAG TAACGACAAT AACGACACCA 
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% ACAAACAACA GGAACACAGA TTAAGCTCAG AAACAAAAAA AAAAAA 

is provided. 

The nucleotide sequence according to the invention includes a 
sequence which is active as a promoter and furnishes an expression system 
5 which is highly variable and suitable for Kluyveromyces, but can also be 
used for other types of yeast, in particular Saccharomyces cerevisiae. 

The DNA sequence according to the invention, in accordance with 
SEQ ID No 1 is a nucleic acid sequence which contains regulatory regions of 
a gene which codes for the enzyme endopolygalacturonase. The enzyme 

10 endopolygalacturonase breaks down pectin insofar as it cleaves 1,4-a-D- 
galactosituronic bonds between two non-methylated galacturonic acid 
esters. That enzyme occurs inter alia in yeast strains of the type 
Kluyveromyces marxianus. The sequence of the endopolygalacturonase 
gene from Kluyveromyces marxianus var. marxianus was published in Yeast 

15 15, 311-322 (1999). The promoter region which can be derived from the 
stated sequence however did not reliably result in the expression of various 
proteins. 

It has now been found that a sequence which includes a part of the 
nucleotides of SEQ ID No 1, preferably at least the nucleotides 1 to 1134 

20 and in particular the entire sequence permits the expression of proteins in a 
highly advantageous and reproducible fashion. The claimed sequence in 
accordance with SEQ ID No 1 has a plurality of regulatory components so 
that it can perform its function as a promoter under the most widely 
varying conditions. The promoter according to the invention can be induced 

25 by the addition of pectin to the culture medium. That is advantageous as 
pectin is a readily available substance and thus an advantageous induction 
agent is available for the system according to the invention. 

The described nucleic acid sequences with a regulatory activity 
include such sequences which have occurred by virtue of modification, 

30 substitution, deletion or insertion or combinations thereof, which involve 
the same or better regulatory activity than the promoter, the terminator or 
the signal sequence. For that purpose it is also possible to include the 
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sequence in accordance with SEQ ID No 1 with further regulatory upstream 
sequences with activator and/or repressor functions. 

In accordance with the invention moreover such sequences are also 
considered, which with the claimed nucleotides or sequences have a 
homology of at least 80%, preferably at least 90% and in particular 95%, 
as long as they also have comparable activity. 

Preferably the sequence in accordance with SEQ ID No 1 is afforded 
in the form of an expression system or an expression cassette for the 
expression of proteins and peptides. The expression cassette according to 
the invention includes in its simplest form a regulatory sequence as is 
shown in SEQ ID No 1 or a part thereof which is active as a promoter, an 
insertion cloning site into which the polynucleotide can be cloned for the 
protein to be expressed, and the nucleotide sequence in accordance with 
SEQ ID No 2 

GCGTCTCT TTTTA I I I I I I I I I I I I I I I TTATTAACGT GAAGAAGATA 
AGGGAAGTCT TCAATGCGGT TCTGAATGGT TGATCCATTT CGATACCTCG 
GGGACTTCCT TTGAATATAT TCTGAGAGTA TGACAGTTGG TTTTCTTTCT 
TTCTTTCTAT TG I I I I I GTT TTTATG G A A A TATAGCTTTG ATGATTTAGG 
ATA I I I I I I G TAGTGAACCA ATACATGCTT GATTAATATA CGTACGAGGT 
GGGCATTCTA CTCTCATTAT TGGTGTTTTA TTGGAGGAA AAATTAAATC 
TAGGAGTATC GTTTAGAGCG CGAACGTAAT ATCCATGTTC TTCTCTTTGA 
AGAGGTCCCA CCATTGCTTC CCAGATGCC AGCATTCTTC CATGATATTT 
TGCGCTTGTT TTGCACTGGT GACACCCTTT CGAACCAAAG ATGTCAAGTG 
CTGCTGATAC AACAACCTGT ATTCATACAA TTCTGGATCC ATCAGCTCAC 
AATCCACAGC TGAAGATACA GAAAATGATA CATGTCTCTG CAG 
which codes the terminator sequence region of the endopolygalacturonase 
gene from Kluyveromyces marxianus. That expression cassette can be used 
in many different ways. The insertion cloning site is a cutting location at 
which the sequence can be cut open and the polynucleotide for the desired 
protein or peptide can be ligated in. In that simplest form after induction 
upon expression the protein is intracellular^ produced and is not removed 
from the cell. It can then be obtained in per se known manner after 
opening the cell. This embodiment is suitable both for small peptides and 
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proteins which are unstable outside the cell and also for proteins which are 
generally intracellular^ located. 

In a further embodiment which is suitable in particular for proteins 
which are to be removed from the cell, use is made of an expression and 
5 secretion system according to the invention. That system includes in 
operative association the nucleic acid sequence in accordance with SEQ ID 
No 1 or a part thereof which is active as a promoter, the sequence in 
accordance with SEQ ID No 2 as a terminator and, between those two 
sequences, the signal sequence in accordance with SEQ ID No 3 
10 ATGT TATTCAGCAA CACCTTATTG ATCGCAGCAG CTAGTGCATT 
ATTAGCTGAA GCTTCTCCAT TGGAAAAGAG A 

for removal of the protein. In this embodiment culture is effected in per se 
known manner, wherein either in a continuous method the protein is 
continuously discharged into the medium and can be continuously obtained 

15 from the fermentation liquor or in a discontinuous method the cells are 
cultivated, harvested and then the protein can be obtained from the liquor. 

The expression cassette according to the invention is suitable both 
for the expression of suitable autonomously replicating plasmids and also 
for incorporation in yeast chromosomes by way of integrative vectors. 

20 A further subject of the invention is the plasmids pEPGl-1, pEPGl-2 

and pEPGsec which are described in greater detail with reference to Figures 
1 to 3 and which contain the expression systems according to the invention. 
Those plasmids are recombinant bacterial plasmids and can be used in the 
present form for amplification of the expression cassettes. The plasmids are 

25 contained in the micro-organisms DSM 12919, DSM 12920, DSM 12921 or 
DSM 12922* and are deposited therewith [*Deposited at DSMZ, 
Mascheroder Weg lb, 38124 Brunswick]. 

It is however preferred for the plasmids, after the desired 
polynucleotide has been ligated in for the expression of a peptide or 

30 protein, to be amplified in E.Coli, then to obtain the plasmids, to cut out the 
expression cassette with suitable restriction endonucleases for which cut 
locations are provided at the edges of the expression cassette, and to ligate 
the expression cassette into a yeast vector. The vectors usually contain 
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selection markers in order to be able to select successfully transformed 
cells in per se known manner. 

The plasmids can possibly be multiplied in E. coll and then put into 
Kluyveromyces marxianus or another Kluyveromyces strain or also another 
5 yeast strain. The transformation system used can be for example known 
plasmids based on the Kluyveromyces drosophilarum plasmid pKDl. 
Derivatives of that plasmid are suitable for use in Kluyveromyces marxianus 
and when using the expression system according to the invention result in 
effective expression and secretion of foreign proteins in the corresponding 
10 host. 

In another embodiment it is possible for the expression cassette 
including the polynucleotide to be expressed to be cut out of the plasmids 
according to the invention, when prepared as above, and brought into 
contact as linear or circularised DNA-strand as an integration cassette 

15 directly with yeast cells in order to be absorbed thereby. By virtue of the 
homology with the endopolygalacturonase gene, then in a part of the 
treated cells, the DNA is received into the corresponding chromosome by 
exchange with the endogalacturonase gene. The selection of successfully 
transfected yeast cells is effected in this embodiment by way of the 

20 differing utilisation of pectin. 

The expression cassette according to the invention is stably 
incorporated into chromosomes and, if the cells are cultured under 
optimum conditions, results in a good yield of the desired protein. The copy 
number of the system can be set in dependence on the nature of the 

25 peptide or protein to be expressed. As the endopolygalacturonase gene is 
only singly present in the chromosome set of the yeast, upon transfection 
or transformation with the expression system provided in accordance with 
the invention a copy of the expression vector is also present only for each 
successfully transformed cell. If a higher number of copies is desired, then 

30 sequences of a gene which is present in the chromosome set in a larger 
number of copies, for example for rDNA, is ligated in per se known manner 
to the ends of the expression cassette in order to cause a higher number of 
exchange events. 



In the latter case a marker is additionally also incorporated in per se 
known manner into the sequence so that the successfully transformed cells 
can be selected. Methods and markers which are suitable for same are 
known to the man skilled in the art and do not need to be discussed in 
greater detail here. 

The system according to the invention is highly variable. Thus for 
example only the sequence in accordance with SEQ ID No 1 or a part 
thereof which is active as a promoter can be combined with other nucleic 
acid sequences which provide further regulatory sequences and with a 
heterologous nucleotide sequence. The sequence in accordance with SEQ 
ID No 1 or a part thereof which is active as a promoter can be combined 
with the sequence of SEQ ID No 2 in order to provide a regulatory system 
which is homologous in Kluyveromyces marxianus and into which the 
polynucleotide for the protein to be expressed is inserted or it is possible for 
a system of SEQ ID No 1, SEQ ID No 2 and SEQ ID No 3 to be combined 
together with a gene to be expressed, which codes a desired protein, in 
order to produce a product which is to be discharged into the culture. As 
Kluyveromyces cells can be grown with many different C-sources and are 
not very demanding in respect of further nutrients and moreover are 
temperature-insensitive, this affords a highly effective system. Reliable 
expression of the foreign proteins is achieved by the regulatory sequence 
provided in accordance with the invention. 

In accordance with the invention, there is provided a system which 
makes it possible to use the type of yeast Kluyveromyces marxianus which 
is highly promising as a host by virtue of its remarkable physiological 
capacities. 

The system according to the invention is suitable for the expression 
of peptides, polypeptides, proteins and hybrid molecules including 
glycosylated proteins. 

Thus in a further embodiment the expression cassette may also 
include the complete sequence of the endopolygalacturonase enzyme or 
parts thereof, wherein a sequence for a desired protein is ligated in 
between the endopolygalacturonase gene and the terminator sequence. 
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Then, in this embodiment, upon expression, a hybrid is obtained, from 
which the endopolygalacturonase is separated in per se known manner. 

Set out hereinafter are some definitions for terms which are used in 
the description. 

Accordingly, an 'expression vector' is a DNA molecule which can be 
linear or of a ring shape and which contains a segment which codes a 
sequence for a protein or peptide which is of interest, which is operatively 
joined to regulatory sequences. Those regulatory sequences include at least 
promoter and terminator sequences. The expression vector can additionally 
contain selectable markers and further regulatory elements and must 
permit transmission and multiplication in host cells. Replication of the 
expression vectors can occur autonomously or by integration into the host 
genome. 

The expression 'DNA' or 'polynucleotide' includes polymer forms of 
deoxyribonucleotides and ribonucleotides of any length and any 
modification in single- and double-stranded form. 

The expression 'operatively joined' means that the individual 
segments are so arranged that they serve the intended purpose, that is to 
say they can inhibit transcription and can promote expression from the 
replication starting point to the termination sequence. 

The claimed sequences may have further short sequences which do 
not interfere with the biological activity of the molecule. In addition the 
claimed sequences also include allele variants of the sequence, that is to 
say alternative forms of the gene which have occurred due to mutation. 

The term 'protein or peptide' relates to a molecular chain of amino 
acids with biological activity. The proteins and/or polypeptides can be 
modified in vivo or in vitro, for example by glycosylation and 
phosphorylation. The term hybrid molecules is used to denote molecules 
which include both homologous parts and also heterologous parts, for 
example such proteins which include a combination of 
endopolygalacturonase or parts thereof with a foreign protein, or for 
example nucleotide sequences in which DNA from Kluyveromyces 
marxianus is combined with DNA from other micro-organisms. 



The expression cassette according to the invention is suitable inter 
alia for yeasts of the strains Kluyveromyces and Saccharomyces and is 
preferably used in yeast strains of the type Kluyveromyces maxianus van 
marxianus. A particularly preferred strain with particularly advantageous 
expression properties is Kluyveromyces maxianus van marxianus BKM Y- 
719 which was described by Siekstele et al 1999 (Yeast 15, 311 - 322 
1999)). 

A further subject of the invention is a method of producing a 
recombinant protein which is characterised in that a yeast cell is 
transformed or transfected with an autonomously replicating plasmid which 
includes an expression cassette according to the invention and a 
polynucleotide which codes a foreign protein, the yeast cell is cultured 
under conditions which are suitable for the expression of the foreign 
protein, and the protein is obtained. 

The subject of the invention is also a method of producing a 
recombinant protein which is characterised in that an expression cassette 
according to the invention is brought into a yeast cell where the expression 
cassette is incorporated into a chromosome, the cell is cultured and then 
the protein is obtained. Particularly preferably the expression cassette 
according to the invention is used as a module which permits the 
construction of episomal or integrative expression vectors which contain the 
regulatory sequences in accordance with SEQ ID No 1, No 2 and/or No 3. 

The expression system according to the invention is suitable for the 
expression of various heterologous proteins. Particularly preferably the 
system is used for the expression of HBVS-antigen (hepatitis B virus, 
surface antigen) and virus protein 1 from polyoma virus. Those proteins are 
antigen proteins and can be particularly advantageously used as vaccines. 

The invention is described by means of the accompanying Figures 
and the following Examples. 

Figures 1 to 4 show plasmids with the expression cassettes according 
to the invention and Figure 6 shows the primers used in Example 2. 

Figure 1 shows the plasmid pEPGl-1. That plasmid includes an 
expression cassette with the promoter according to the invention in 



i 
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accordance with SEQ ID No 1 (referred to as EPGlprom), an insertion 
cloning site which can be cut with BspTl, and the terminator sequence 
according to the invention in accordance with SEQ ID No 2 (referred to as 
EPGlterm). That expression cassette was ligated into the multicloning site 
5 of the plasmid pUC19. The insertion cloning site which can be cut with 
BspTl replaces the open reading frame of the endopolygalacturonase gene 
which was removed. 

Figure 2 shows the plasmid pEPGl-2. That plasmid contains the 
expression cassette according to the invention, which includes a sequence 

10 in accordance with SEQ ID No 1 with the nucleotides 572 to 1134 (referred 
to as EPGlprom) which is active as a promoter, an insertion cloning site 
which can be cut with BspTl, and a terminator sequence which includes the 
nucleotides 28 to 541 of SEQ ID No 2 (referred to as EPGlterm). The 
insertion cloning site replaces the open reading frame of the 

15 endopolygalacturonase gene inclusive of the sequences -1 to -12 and 1087 
to 1115 of that gene. 

Figure 3 shows the plasmid pEPGsec. That plasmid contains an 
expression and secretion system of the invention. The plasmid contains an 
expression cassette with a promoter in accordance with SEQ ID No 1 

20 (referred to as EPGlprom), a terminator in accordance with SEQ ID No 2 
(referred to as EPGlterm), a signal sequence in accordance with SEQ ID No 
3 (referred to as EOGllyd) and two cut locations in order to insert the 
desired nucleic acid sequence for the protein to be expressed. That 
expression cassette was ligated into the multicloning site of the plasmid 

25 pUC19. The open reading frame of the endopolygalacturonase gene was 
removed from positions 75 to 1084 and replaced by a linker with the 
cloning sites Eco 1471 and BpU 11021. 

Figure 4 shows the plasmid pUC19PG. A 2.198 kb Pstl-DNA- 
restriction fragment from a recombinant lambda GEM™-12-bacteriophage 

30 of a genomic gene bank of Kluyveromyces marxianus was ligated into the 
Pstl-restriction location of the multicloning site of the plasmid pUC19. 

Figure 5 shows the plasmid pUC19-PGla. A 2.735 kb Stul-Pvull-DNA- 
restriction fragment from a recombinant lambda GEM™-12-bacteriophage 
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of a genomic gene bank of Kluyveromyces rnarxianus was exchanged in the 
plasmid pUC19 for the 321 kb fragment. The bold part of the plasmid 
corresponds to the DNA fragment from K. rnarxianus while the thin-printed 
part represents the proportion of the plasmid pUC19. 
5 Figure 6 shows the primers in accordance with SEQ ID Nos 4 and 5, 

which are used for the cloning of promoter and signal sequence. 

Figure 7 shows the signal sequence in accordance with SEQ ID No 3 
and the associated signal peptide (with sequence for pre- and 
prepropeptide) of the endopolygalacturonase from Kluyveromyces 
10 rnarxianus. 

Four micro-organisms which contain the plasmids in accordance with 
Figure 1, Figure 2, Figure 3 and Figure 4 were deposited at the DSMZ in 
accordance with the requirements of the Budapest Treaty. E.coli pEPGl-1 is 
deposited under deposit number DSM 12919, E.coli pUC19PG is deposited 
15 as DSM 12920, E.coli pEPGseq deposited under the number DSM 12921 
and E.coli pEPGl-2 under the deposit number DSM 12922. 
Example 1 

Manufacture of the basic plasmid for the production of the expression 
casssettes 

a 

20 A Pstl-DNA-fragment with 2.198 kb, which contains the complete 

gene of the endopolygalacturonase (EPG1) with the regulatory sequences 
was inserted into the Pstl-cut location of the multicloning site of the 
bacterial plasmid pUC19. That construct is shown in Figure 4. Using in each 
case two opposite primers which at the 5'-end bore a recognition site for 

25 the restriction enzyme BstTl and at the 3'-end have homology with the 
promoter, signal and/or terminator sequence of the EPGl-gene, it was 
possible by means of PCR passing around the plasmid to produce a DNA 
fragment which after restriction with BspTl could be circularised with a 
ligase. When using different recognition sites for restriction enzymes at the 

30 S'-ends of the PCR primers, suitable linkers had to be ligated in for 
circularisation ligation or the primers contained additional sequences at the 
5'-end, which represented a common recognition site for the restriction 
enzyme. Depending on the respective choice of primers, defined areas of 



the EPG1 region can be deleted from the plasmid by restriction and 
subsequent circularisation. The recombinant deletion plasmids were 
amplified in Escherichia coli and served as a basis for the expression 
cassettes. By way of the various recognition sites for restriction enzymes in 
the multicloning site of the plasmid pUC19 the cassette can then be cut out 
and cloned into an episomal or integrative vector for a suitable yeast strain. 
The cassette can also be used directly as a Pstl-DNA-fragment for 
integration into a yeast host strain. 
Example 2 

Manufacture of a recombinant plasmid with sequence ID No 1, No 2 and No 
3 

In the bacterial basic plasmid pUC19 the 321 bp Pvull fragment was 
replaced by a Stul/Pvull DNA fragment of 2.735 kb which contains the 
complete endopolygalacturonase gene inclusive of the regulatory sequences 
-1142 to -1 and +1087 to 1595 from Kluyveromyces marxianus. The 
Stul/Pvull fragment was for that purpose isolated from a recombinant 
lambda GEM™-12-bacteriophage of a genomic gene bank of Kluyveromyces 
marxianus. A map of the plasmid produced: 'pUCig-PGla' is shown in 
Figure 5. That plasmid can be used in a similar manner to Example 1 for the 
production of expression cassettes. As the multicloning site was completely 
deleted by the cloning strategy with that plasmid and the Stul location in 
the upstream promoter region of the EPG1 gene was destroyed by ligation 
with the Pvull location in the pUC19, the Pvull site which has remained 
downstream in relation to the EPG1 gene and for example the unique Narl 
site in the lacZ-part of the pUC19 can be used for cutting out the cassette. 

By means of the PCR and using the primers shown in Figure 6 (SEQ 
ID No 4 and No 5) it is possible to generate a DNA fragment which contains 
the promoter region and the signal sequence in accordance with SEQ ID No 
1 and No 3. The degenerated rPEPG primer has in the 5' region a mismatch 
with respect to the EPG1 sequence in the region 1220 and thereby 
generates from the EPG1 sequence: CAGTTG a Pvull site (CAGGTG). That 
point mutation (transition) leads to an amino acid exchange insofar as a 
cysteine codon is changed from TGT into an arginine codon after CGT. That 



13 

exchange occurs however in the proportion of the endopolygatacturonase 
which is replaced by the reading frame of a foreign gene and therefore has 
no influence on the expression. The Stul/Pvull-PCR fragment can be 
inserted as a blunt end into the multicloning site of a corresponding vector. 
5 By way of the Pvull site it is possible 'in frame' to add open reading frames 
of genes whose expression product is to be removed from the cell. 
Example 3 

Manufacture of polyoma JCV MAJOR COAT PROTEIN VP1: 

For the expression of the JCV VPl-gene the expression cassette 
10 pEPGl-1 was used. The VPl gene was amplified by means of PCR using the 
primers: 

JC5 (SEQ ID No 6): 5TC AAGCTT AAGAATGGCCCCAACAAAAAGA 3' 
and 

JC3 (SEQ ID No 7): 5 ' GTAAGCTTA AG ATTAC AG C ATTTTTGTCTG 3' 

15 and cloned as a Bspll fragment into the expression cassette of pEPGl-1. 
The cassette was cleaved with Pstl and the DNA fragments were converted 
at their ends by means of T4 polymerase to blunt ends. 

The 'blunt ended' cassette was inserted into the Smal site of the 
Kluyveromyces vector pKDSU. The transcription direction of the VPl-gene 

20 is in that case opposite to transcription of the S.cerevisiae URA3 gene. 

The recipient (BKM Y-719 ura) was transformed with the expression 
construct and uracil prototrophic clones were isolated. The presence of the 
expression cassette with the reading frame of the VPl gene was checked by 
means of PCR and the above-indicated primers. Selected clones were 

25 cultivated in various liquid media. The yeast cells were harvested by 
centrifuging and broken up in lysis buffer (lOmM TRIS-HCI, pH 7.5, 0.01% 
TRITON x 100, ImM CaCI 2 , lOOmM NaCl with ImM PMSF) with micro-glass 
beads at 0°C. The glass beads were then sedimented by centrifuging (2000 
rpm) for 10 minutes. The supernatant material was arranged in a layer on a 

30 20% saccharose pad (in lysis buffer) and centrifuged for 2 hours at 4°C and 
35,000 rpm (Beckman L8-75 rotor SW41). The pellet was suspended in 
lysis buffer and arranged in a layer on a CsCI-gradient (1.24 to 1.38 g/ml 
CsCI). The centrifuging operation was effected at 4°C, 35,000 rpm for 16 
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hours in the SW41 rotor. 'Virus like particles' of the JCV VP1 of an average 
particle size of 45 nm were obtained from the fractions between 1.3 and 
1.34 g/ml CsCI. 
Example 4 

Manufacture of hepatitis B virus surface antigen (HBV s-antiqenl: 

The expression cassettes pEPGl-1 and pEPGl-2 were used for 
expression of the HBV s-antigen gene of subtype (AYW). The HBS gene was 
amplified by means of PCR using the primers: 

HB5 (SEQ ID No 8): 5'AGCCTTAAGATAATGGAGAACATCACATCAGG 3' 
and 

HB3 SEQ ID No 9): 5TGACTTAAGTTAAATGTATACCCAAAG 3' 
and cloned as BspTl fragment into the expression cassettes. The cassettes 
were cleaved with BamHl and the DNA fragments filled at their ends by 
means of Klenow polymerase to form blunt ends. 

The 'blunt ended' expression cassettes were inserted into the Smal 
site of the Kluyveromyces vector pKDSU. The transcription direction of the 
HBS gene in that case is opposite to the transcription of the S.cerevisiae 
URA3 gene. The recipient (BKM Y-719 ura) was transformed with the 
expression constructs and uracil prototrophic clones were isolated on 
selective medium. Whole DNA was isolated from selected clones and used 
for detection of the expression cassette by means of PCR. Positive clones 
were cultivated in liquid, synthetic and in complete media and used for the 
production of S-protein particles. Corresponding yeast cultures were 
harvested by centrifuging and broken up in a PBS buffer with ImM PMSF 
with micro-glass beads at 0°C. The glass beads were then sedimented by 
centrifuging (2000 rpm) for 10 minutes. The yeast membrane fraction was 
sedimented from the supernatant material by centrifuging for 30 minutes at 
14,000 rpm (J20 Rotor Beckman J2-21). HBV s-antigen sedimented with 
that fraction and was eluated by 0.5% Tween-20 in PBS from the fraction. 
After renewed centrifuging at 14,000 rpm the supernatant material was 
obtained and separated off by CsCI-density gradient centrifuging. Highly 
purified HBV s-antigen particles were isolated from the 1.2 g/ml CsCI 
fraction. 
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The yield of isolatable antigen, in cultivation in synthetic media, with 
the pEPGl-2 cassette, was twice as high as with the pEPGl-1 cassette. 
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CLAIMS 

1. A DNA sequence including the nucleotide sequence of SEQ ID No 

1. 

2. A yeast expression system containing in operative junction the 
nucleotide sequence of SEQ ID No 1 or a part thereof which is active as a 
promoter, an insertion cloning site and the nucleotide sequence of SEQ ID 
No 2 or a part thereof which is active as a terminator. 

3. A yeast expression and secretion system including in operative 
junction a sequence in accordance with SEQ ID No 1 or a part thereof which 
is active as a promoter, the nucleotide sequence of SEQ ID No 3, an 
insertion cloning site and the nucleotide sequence of SEQ ID No 2 or a part 
thereof which is active as a terminator. 

4. Plasmid pEPGl-1 containing a yeast expression cassette according 
to claim 2 deposited under the deposit number DSM 12919. 

5. Plasmid pEPGl-2 containing a yeast expression cassette according 
to claim 2 deposited under the deposit number DSM 12922. 

6. Plasmid pUC19PG deposited under the deposit number 12920. 

7. Plasmid pEPGsec containing a yeast expression cassette according 
to claim 3 deposited under the deposit number DSM 12921. 

8. An expression vector containing in operative junction a promoter 
with the sequence of SEQ ID No 1 or a part thereof which is active as-a 
promoter, a polynucleotide which codes a foreign protein, and a terminator 
sequence. 
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9. An expression vector according to claim 8 which in addition also 
includes a signal sequence between promoter and polynucleotide. 

10. An expression vector according to claim 9 characterised in that 
the signal sequence is a sequence in accordance with SEQ ID No 3. 

11. An expression vector according to one of claims 8 to 10 
characterised in that the polynucleotide codes an antigen protein or 
peptide. 

12. An expression vector according to claim 11 characterised in that 
the polynucleotide codes a hepatitis B surface antigen, VP1 from polyoma 
virus or protein A from Staphylococcus. 

13. An expression vector according to one of claims 8 to 12 
characterised in that the vector is an integrative or episomal vector. 

14. An expression vector according to one of claims 8 to 12 
characterised in that the vector is a plasmid replicatable in yeast. 

15. A host cell that has been transformed with an expression vector 
or a plasmid according to one of the preceding claims. 

16. A host cell according to claim 14 characterised in that it is a cell 
of the type Kluyveromyces marxianus. 

17. E.coli pEPGl-1 deposited under the deposit number DSM 12919. 

18. E.coli pUC19PG deposited under the deposit number DSM 12920. 

19. E.coli pEPGseq deposited under the deposit number DSM 12921. 

20. E.coli pEPGl-2 deposited under the deposit number DSM 12922. 
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21. A method of manufacturing a recombinant protein, characterised 
in that a yeast cell is transfected or transformed with a plasmid which 
includes the expression cassette according to one of claims 2 and 3 and a 
polynucleotide which codes a foreign protein, the yeast cell is cultured 
under conditions which are suitable for the expression of the foreign 
protein, and the protein is produced. 

22. A method of manufacturing a recombinant protein characterised 
in that an expression cassette according to claim 2 or claim 3 is put into a 
yeast cell where the expression cassette is incorporated into a 
chromosome, the cell is cultured and then the protein is obtained. 

23. Use of a DNA sequence according to SEQ ID No 1 as a promoter 
for the expression of foreign proteins in yeast cells. 
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Expression cassete with BspTI insertion cloning site I /.gated into the multicloning site of the 
plasmid pUC19. The open readvngframe of the endopolygalacturonase gene has been 
removed. 
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PCT/EPOO/08662 

WO 01/20005 2/7 

Expression cassete with BspTI insertion cloning site legated into the multicloning site of the 
plasmid pUC19. The open read »:ng frame of the endopolygalacturonase gene including the 
sequences of -1 to -12 and 1087 to 1115 has been removed and replaced by a BspTI cloning 
site. 

pEPGl-2 

-32 cacagatuu gctcagaiAC(i^ias^ aa ATG...ORF EPG...TAA gc gtctcnxn wmnm rattti)Tai taacgigaag aag 



, cacaeariaa gctcagaaaCTTAAG Tat uacgtgaag aag... 

t 

BspTI 



Fig. 2 
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Expression cassette with secretion signal sequence ligates into the multicloning site of the 
plasmid pUC19. The open reading frame of the endopolygalacturonase gene was removed 
from position 75 to 1084 and replaced by a linker with the cloning sites Ecol471 and 
BpUU021. 

pEPGsecl 



K R Stop 
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K R Stop 
.AAG AGg cct agt gag cct tct gcT AAG C G TCT. 
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Fig 3 
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Plasmid pUC19PG 

A 2.198 kb Pstl DNS restriction fragment from a recombinant LambdaGEM™~12 of the 
Kluyveromyces marxianus gene bank was ligated into the Pstl restriction site of the 
multicloning site of the plasmid pUC19. 
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Plasmid pUC19PGla: 

A 2.735 kb Stul-Pvull DNS restriction fragment from a recombinant LambdaGEM™-12 of the 
Kluyveromyces marxianus gene bank was exchanged in the plasmid pl)C19 for the 321 b 
Pvull fragment. 

The bold part corresponds to the DNS fragment from K. marxianus. The thin-printed part 
represents the proportion of the plasmid pUC19. 
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PCR primers for blunt end cioning of promoter and signal sequence of the EPG1 gene from 
Ktuyveromyces marxianus 



rPEPG: 5TTAACAGCTGTCTCTCTTTTCCAATGGAGAAGC 3' 
PvuW 

vPEPG:5'TTAAGAGGCCTGTCCGATTATAAACTTGCGGC 3* 



Fig. 6 
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Fig 2: Signal sequence and signal peptid of endoppolygalacturonase from Kluyveromyces 
marxianus 
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Fig. 7 
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Date: 
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I hereby declare that all statements made herein of my own knowledge are true and that all statements 
made on information and belief are believed to be true; and further that these statements were made with 
the knowledge that willful false statements and the like so made are punishable by fine or imprisonment, 
or both, under Section 1001 of Title 18 of the United States Code and that such willful false statements 
may jeopardize the validity of the application or any patent issued thereon. 
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Citizenship : Lithuania 
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LEOPOLD DOHNER 
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<110> TAD Pharmazeutisches Werk 



<120> 



Regulatory sequences and expression cassettes for yeasts 



<130> 



TM5071 



<140> 
<141> 



TM5071 
1999-09-10 



<I60> 
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<170> 



Patentln Ver. 2.1 



<210> 
<211> 
<121> 
<213> 



1 

1146 
DNA 

Kluyveromyces marxianus 



<400> 



gaggcctgtc cgattattaa acttgcggca cccgagtttg tgaccttcga cgacatgttt 60 
tatttccaca ccgtagctac actttctatg tagtaagtag gtagtatgga tggtagctag 120 
tagaaactaa acgaaacgaa ataaatgtga aatgttagac gtaaagggga ggggaagggal80 
agggggcggc ggagagacat gccaagccat gccatttcat ggcatggcat gtcaagggat 240 
actgcatgca tgcatgcata ctttaccaat agcaaagtaa attgctttct tcccccattt 300 
gaaactattc cacctcaatc catcttttct ataatgggta tcaccgatct catgtgttct 360 
aataatgctg caggcaacaa caaatcttaa aggcaacttg gaatgtaatt tggttaatga 420 
tagatatcaa acagcaatgg tgggctccaa ccgcatggat atgctcacct tattatccgg 480 
aattgttgtt ccgcaggaaa aaaaaaaaac ctcgaaccag atattaatta tcctatcatt 540 
actgcgtaca aaacccggga acggttaacc tgcagcagcc gttttgctta cagttctcat 600 
gcacaatcag ccagattttg caatagtatt aacttagaat taaggcaaca tctttggata 660 
tgcatgtaga gtaagtcgtt cgaaaccatt attattatta ttattattat tattattatt 720 
attattatta ttattagtat tattgaaatt gttattgttc ttagtttcac tactattatt 780 
attcatattc atgttattga catcgccgaa cgaccagect ccataccgat tagacaggat 840 
ctcaaacgtq ggctccagag ctcacacatt atgctaaata actatctact gtaacagcta 900 
cagaaaaaaa actataaaag agcgagggat aaaccactct cttgtgaatc aggatcagta 960 
ggtaactcat aaaccttctt cttttctctc aaaatatcaa ataacagtag tatcaacaacl020 
gatatcgaat aatactaact actacaacag taggaacagt aacgacaacg acaacgatag 1080 
taacgacaat aacgacacca acaaacaaca ggaacacaga ttaagctcag aaacaaaaaall40 
aaaaaa 1146 

<210> 2 

<211> 541 

<212> DNA 

<213> Kluyveromyces marxianus 



<400> 2 



gcgtctcttt ttattttttt tttttttttt attaacgtga agaagataag ggaagtcttc 69 
aatgcggttc tgaatggttg atccatttcg atacctcggg gacttccttt gaatatattc 120 



•■'is H-l: u tt'. ii « 



20 

tgagagtatg acagttggtt ttctttcttt ctttctattg tttttgtttt tatggaaata 180 

tagctttgat gatttaggat attttttgta gtgaaccaat acatgcttga ttaatatacg 240 

tacgaggtgg gcattctact ctcattattg gtgttttatt ggagggaaaa attaaatca 300 

ggagtatcgt ttagagcgcg aacgtaatt ccatcttctt ctctttgaag aggtcccacc 360 

attgcttccc agatagccag cattcttcca tgatattttg cgcttgttt gcactggtga 420 

caccctttcg aaccaaagat gtcaagtgct gctgatacaa caacctgtat tcatacaatt 480 

ctggatccat cagctcacaa tccacagctg aagatacagca aaatgataca tgtctctgca 540 

9 541 

<210> 3 
<211> 75 
<212> DNA 

<213> Kluyveromyces marxianus 
<400> 3 

atgttattca gcaacacctt attgatcgca gcagctagtg cattattagc tgaagcttct 60 
ccattggaaa agaga 75 

<210> 4 
<211> 33 
<212> DNA 

<213> Kluyveromyces marxianus 
<400> 4 

ttaacagctg tctctctttt ccaatggaga age 33 

<210> 5 
<211> 32 
<212> DNA 

<213> Kluyveromyces marxianus 
<400> 5 

ttaagaggee tgtccgatta taaacttgcg gc 32 

<210> 6 

<211> 30 

<212> DNA 

<213> Polyomavirus sp. 

<400>6 

tcaagcttaa gaatggcccc aacaaaaaga 30 

<210> 7 

<211> 30 

<212> DNA 

<213> Polyomavirus sp. 



<400> 1 

gtaagcttaa gattacagca tttttgtctg 



30 



<210> 8 

<211> 32 

<212> DNA 

<213> Hepatitis B virus 

<400> 8 

agccttaaga taatggagaa catcacatca gg 

<210> 9 

<211> 27 

<212> DNA 

<213> Hepatitis B virus 



<400> 9 

tgacttaagt taaatgtata cccaaag 
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